Understanding Dropout
نویسندگان
چکیده
Dropout is a relatively new algorithm for training neural networks which relies on stochastically “dropping out” neurons during training in order to avoid the co-adaptation of feature detectors. We introduce a general formalism for studying dropout on either units or connections, with arbitrary probability values, and use it to analyze the averaging and regularizing properties of dropout in both linear and non-linear networks. For deep neural networks, the averaging properties of dropout are characterized by three recursive equations, including the approximation of expectations by normalized weighted geometric means. We provide estimates and bounds for these approximations and corroborate the results with simulations. Among other results, we also show how dropout performs stochastic gradient descent on a regularized error function.
منابع مشابه
Dropping out and a crisis of trust
This article focuses on the high dropout rate in the Danish vocational education system. The article seeks to analyse the dropout problem using the concept of trust. By taking this perspective on the dropout problem, the article seeks to replace the hitherto-dominant deficit of understanding within the Danish dropout research, which has focused on the so-called crisis of motivation in which stu...
متن کاملA Non-Random Dropout Model for Analyzing Longitudinal Skew-Normal Response
In this paper, multivariate skew-normal distribution is em- ployed for analyzing an outcome based dropout model for repeated mea- surements with non-random dropout in skew regression data sets. A probit regression is considered as the conditional probability of an ob- servation to be missing given outcomes. A simulation study of using the proposed methodology and comparing it with a semi-parame...
متن کاملA Comparative Review of Selection Models in Longitudinal Continuous Response Data with Dropout
Missing values occur in studies of various disciplines such as social sciences, medicine, and economics. The missing mechanism in these studies should be investigated more carefully. In this article, some models, proposed in the literature on longitudinal data with dropout are reviewed and compared. In an applied example it is shown that the selection model of Hausman and Wise (1979, Econometri...
متن کاملABSTRACT Title of Thesis: SCHOOL DROPOUT AND SUBSEQUENT OFFENDING: DISTINGUISHING SELECTION FROM CAUSATION
Title of Thesis: SCHOOL DROPOUT AND SUBSEQUENT OFFENDING: DISTINGUISHING SELECTION FROM CAUSATION Gary Allen Sweeten, Master of Arts, 2004 Thesis directed by: Professor Shawn D. Bushway Department of Criminology and Criminal Justice Past research on the relationship between school dropout and offending is inconclusive. In explaining their findings, researchers have focused on strain and control...
متن کاملSurprising properties of dropout in deep networks
We analyze dropout in deep networks with rectified linear units and the quadratic loss. Our results expose surprising differences between the behavior of dropout and more traditional regularizers like weight decay. For example, on some simple data sets dropout training produces negative weights even though the output is the sum of the inputs. This provides a counterpoint to the suggestion that ...
متن کامل